A Preliminary Investigation of Overfitting in Evolutionary Driven Model Induction: Implications for Financial Modelling

نویسندگان

  • Clíodhna Tuite
  • Alexandros Agapitos
  • Michael O'Neill
  • Anthony Brabazon
چکیده

This paper investigates the effects of early stopping as a method to counteract overfitting in evolutionary data modelling using Genetic Programming. Early stopping has been proposed as a method to avoid model overtraining, which has been shown to lead to a significant degradation of out-of-sample performance. If we assume some sort of performance metric maximisation, the most widely used early training stopping criterion is the moment within the learning process that an unbiased estimate of the performance of the model begins to decrease after a strictly monotonic increase through the earlier learning iterations. We are conducting an initial investigation on the effects of early stopping in the performance of Genetic Programming in symbolic regression and financial modelling. Empirical results suggest that early stopping using the above criterion increases the extrapolation abilities of symbolic regression models, but is by no means the optimal training-stopping criterion in the case of a real-world financial dataset.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Approach to Reducing Overfitting in FCM with Evolutionary Optimization

Fuzzy clustering methods are conveniently employed in constructing a fuzzy model of a system, but they need to tune some parameters. In this research, FCM is chosen for fuzzy clustering. Parameters such as the number of clusters and the value of fuzzifier significantly influence the extent of generalization of the fuzzy model. These two parameters require tuning to reduce the overfitting in the...

متن کامل

Fuzzy Logic Based Life Estimation of PWM Driven Induction Motors

Pulse-width modulated (PWM) adjustable frequency drives (AFDs) are extensively used in industries for control of induction motors. It has led to significant advantages in terms of the performance, size, and efficiency but the output voltage waveform no longer remains sinusoidal. Hence, overshoots, high rate of rise, harmonics and transients are observed in the voltage wave. They increase voltag...

متن کامل

Tackling Overfitting in Evolutionary-Driven Financial Model Induction

This chapter explores the issue of overfitting in grammar-based Genetic Programming. Tools such as Genetic Programming are well suited to problems in finance where we seek to learn or induce a model from the data. Models that overfit the data upon which they are trained prevent model generalisation, which is an important goal of learning algorithms. Early stopping is a technique that is frequen...

متن کامل

Selection of energy source and evolutionary stable strategies for power plants under financial intervention of government

Currently, many socially responsible governments adopt economic incentives and deterrents to manage environmental impacts of electricity suppliers. Considering the Stackelberg leadership of the government, the government’s role in the competition of power plants in an electricity market is investigated. A one-population evolutionary game model of power plants is developed to study how their pro...

متن کامل

Statistical Modelling of a Preliminary Process for Depolymerisation of Cassava Non-starch Carbohydrate Using Organic Acids and Salt

A preliminary study on statistical modelling of a process for depolymerisation of cassava non-starch carbohydrate using halide salt assisted phosphoric and pyruvic acids were accomplished. The effects of three independent variables namely; acid concentration, potassium iodide salt and duration were studied using the central composite rotatable design on hydrolysis of the cassava non-starch carb...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011